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SORTING IMAGES FOR IMPROVED DATA ENTRY PRODUCTIVITY 
FIELD OF THE INVENTION 

The present invention relates generally to methods 
and apparatus for data encoding, and specifically to 
5 methods for enhancing the efficiency of data key-in. 

BACKGROUND OF THE INVENTION 

There are many applications for data input from a 
hard copy to a computer system that use automated Optical 
Character Recognition (OCR) , followed by manual 

10 verification of the OCR results. Human operators perform 
the verification step, either by reviewing all the fields 
in the original document, and correcting errors and 
rejects discovered in the OCR results, or by viewing and 
correcting only the characters or fields that have a low 

15 OCR confidence level. For example, US Patent 5,455,875, 
to Chevion et al., whose disclosure is incorporated 
herein by reference, describes a method for organizing 
data on -a- computer screen so as to improve productivity 
of human operators in verifying OCR results. The method 

20 is implemented in document processing systems produced by 
IBM Corporation (Armonk, New York) , in which the method 
is referred to as "SmartKey." Such OCR-based methods are 
useful as long as the OCR results are reliable, so that 
there are relatively few errors that the operator must 

25 correct. 

Despite advances in the robustness of optical 
character recognition (OCR) , there are still many 
document processing applications that do not achieve the 
requisite level of reliability for OCR-based processing. 
30 In these applications, it is necessary for human 
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operators to key data into a computer terminal. The 
operator views either the paper document itself or an 
image of the document (or of a selected part of the 
document) and codes the document contents by typing 
letters and numbers into the terminal. This sort of 
manual coding is costly, and there is a need for 
techniques that can be used to reduce the amount of time 
that the operator spends keying in each document entry. 
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SUMMARY OF THE INVENTION 

In preferred embodiments of the present invention, 
an operator codes entries that have been written in a 
given field in a plurality of form documents. To 
5 facilitate this process, an image of the given field is 
extracted from each of the documents. The field images 
are then sorted into an order according to a measure of 
similarity between the images, so that similar images are 
grouped together. The similarity measure is chosen so 

10 that the images in a given group are likely to have the 
same code, due to shared shape characteristics and/or 
other image features. 

The field images are displayed to the operator one 
after another on a computer terminal. When the operator 

15 views the first image in a group of similar images, he or 
she keys in the appropriate code for the entry, typically 
by typing letters and/or numbers into the terminal as in 
conventional key-in systems. Because of the similarity 
between the entries in the group, it is likely that 

20 subsequent images in the same group will have the same 
code as the previous image. Therefore, after the 

operator has keyed in the code of the first image, this 
code is displayed alongside each of the subsequent images 
as they are presented to the operator. Then, instead of 

25 re-keying the entire code for each subsequent image, the 
operator can simply confirm that the displayed code is 
the correct one, preferably by a single input action, 
such as a single keystroke (such as <ENTER>) or mouse 
click. When the operator determines that a given entry 

30 should not have the same code as the preceding one, he or 
she keys in the new code for this entry. This new code 
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preferably becomes the one that is displayed alongside 
subsequent images for operator confirmation. 

This technique of sorting images for data key-in 
saves the operator considerable time and reduces the 
5 probability of error. It is particularly useful in 
large-scale key-in of form documents, in which the same 
word is likely to appear in a given field on many of the 
forms. It can also be used, however, in other image 
processing applications, in which an operator must code a 
10 large number of similar images. 

There is therefore provided, in accordance with a 
preferred embodiment of the present invention, a method 
for data entry, including: 

receiving a plurality of images; 
15 sorting the images into an order responsive to a 

measure of similarity between the images, so as to group 
similar images together in the order; 

presenting to an operator a first image among the 
images in the order, and receiving an input from the 
20 operator specifying a code to be assigned to the first 
image ; 

presenting to the operator a second image, 
subsequent to the first image among the images in the 
order, along with the code specified by the operator for 
25 assignment to the first image; and 

assigning the code to the second image responsive to 
a single input action by the operator, indicating that 
the second image is to be assigned the same code as the 
first image. 

30 In a preferred embodiment, the plurality of the 

images include entries in fields in one or more form 
documents, wherein the one or more documents include 
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multiple fields, and wherein receiving the plurality of 
the images includes extracting the entries from a 
selected one of the fields in the documents. 

Typically, the images include alphanumeric 
5 characters, and the code includes alphanumeric codes 
input by the operator corresponding to the alphanumeric 
characters appearing in the first image. Preferably, 
sorting the images includes applying optical character 
recognition (OCR) to the images so as to associate OCR 

10 codes with the characters, and grouping the images 
according to the OCR codes. Most preferably, grouping 
the images includes finding at least an approximate match 
between a first string of the OCR codes associated with 
the characters in the first image and a second string of 

15 the OCR codes associated with the characters in the 
second image. 

Preferably, the single input action includes a 
single keystroke on a keyboard. Further preferably, 
receiving the input from the operator specifying the code 

20 to be assigned to the first image includes receiving a 
first input specifying a first code, and the method 
includes, when the second image is not to be assigned the 
same code as the first image, receiving a second input 
from the operator specifying a second code to be assigned 

25 to the second image. Most preferably, the method 

includes presenting to the operator a third image, 
subsequent to the second image among the images in the 
order, along with the second code specified by the 
operator, and assigning the second code to the third 

30 image responsive to the single input action by the 
operator . 
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There is also provided, in accordance with a 
preferred embodiment of the present invention, data entry 
apparatus, including: 

a memory, arranged to store a plurality of images; 
5 a display; 

a user input device; and 

an image processor, arranged to sort the images in 
the memory into an order responsive to a measure of 
similarity between the images so as to group similar 

10 images together in the order, and further arranged to 
drive the display so as to present to an operator a first 
image among the images in the order, and to receive an 
input from the operator via the user input device 
specifying a code to be assigned to the first image, and 

15 still further arranged to drive the display to present to 
the operator a second image, subsequent to the first 
image among the images in the order, along with the code 
specified by the operator for assignment to the first 
image, and to assign the code to the second image 

20 responsive to a single input action applied to the user 
input device by the operator indicating that the second 
image is to be assigned the same code as the first image. 

There is additionally provided, in accordance with a 
preferred embodiment of the present invention, a computer 

25 software product, including a computer-readable medium in 
which program instructions are stored, which 
instructions, when read by a computer, cause the computer 
to receive and sort a plurality of images into an order 
responsive to a measure of similarity between the images 

30 so as to group similar images together in the order, and 
further cause the computer to present to an operator a 
first image among the images in the order, and to receive 
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an input from the specifying a code to be assigned to the 
first image, and still further cause the computer to 
present to the operator a second image, subsequent to the 
first image among the images in the order, along with the 
5 code specified by the operator for assignment to the 
first image, and to assign the code to the second image 
responsive to a single input action by the operator 
indicating that the second image is to be assigned the 
same code as the first image. 
10 The present invention will be more fully understood 

from the following detailed description of the preferred 
embodiments thereof, taken together with the drawings in 
which : 

BRIEF DESCRIPTION OF THE DRAWINGS 

15 Fig. 1 is a schematic, pictorial illustration of a 

system for data key-in, in accordance with a preferred 
embodiment of the present invention ; 

Fig. 2 is a schematic representation of a computer 
terminal display that is presented to a key-in operator, 

20 in accordance with a preferred embodiment of the present 
invention; and 

Fig. 3 is a flow chart that schematically 
illustrates a method for data key-in, in accordance with 
a preferred embodiment of the present invention. 
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DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

Fig. 1 is a simplified pictorial illustration 
showing a system 20 for coding of information, in 
accordance with a preferred embodiment of the present 
5 invention. System 20 receives a document 22 for 

extraction of information therefrom. The document 

preferably comprises a preprinted form having fields that 
are filled in with contents. The contents typically 
comprise handwritten alphanumeric characters, although 

10 they may comprise symbols or marks of other types, 
generated by hand or by machine. 

In order to extract information from document 22, a 
scanner 24 first captures an image of document 22 and 
conveys the corresponding image data to an image 

15 processor 26, which typically comprises a suitable 
general-purpose computer. Alternatively, the document 
image is input to the processor from another source, 
typically in electronic form. Processor 26 receives a 
large number of such images, stores them in a memory 36, 

20 and then processes and sorts them, as described in detail 
hereinbelow. The sorted images are displayed on a 
monitor screen 28 for viewing by an operator 30. The 
operator codes the images by keying in alphanumeric 
characters, using a keyboard 32. A pointing device, such 

25 as a mouse 34, is typically also provided. 

Although for simplicity of illustration, all of the 
image sorting, display and other computerized functions 
of system 20 are shown as being performed by the single 
processor 26, it will be apparent to those skilled in the 

30 art these functions may easily be distributed among a 
number of different computers and/or terminals. These 
functions are preferably performed using software running 
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on processor 26 and possibly on such other computers 
and/or terminals. The software may be supplied on 
tangible media, such as diskettes or CD-ROM, and loaded 
into the processor. Alternatively, the software may be 
5 downloaded to the processor via a network connection or 
other electronic link. 

Fig. 2 is a schematic representation of screen 28, 
presenting information for coding by operator 30, in 
accordance with a preferred embodiment of the present 

10 invention. In this exemplary embodiment, document 22 is 
one of a large number of forms that include a field in 
which a person has written the make of a car. {For 
example, the forms may be copies of traffic tickets 
issued by policemen.) Processor 26 extracts images of 

15 the car make field from the forms in memory 36, and 
displays the handwritten car make from one of these forms 
in a display window 40. Alphanumeric characters 

corresponding to the car make in window 40 are entered in 
a data window 42. The characters may be typed into 

20 window 42 by operator 30, or they may alternatively be 
presented by processor 26 for confirmation by the 
operator, as described hereinbelow. The operator 

confirms that the data in window 42 represent the correct 
coding of the image in window 40 by a single action, such 

25 as a keystroke (typically <ENTER>) on keyboard 32 or a 
mouse click on an on-screen button 44, or on another 
appropriate on-screen control. 

Fig. 3 is a flow chart that schematically 
illustrates a method for data coding by operator 30, in 

30 accordance with a preferred embodiment of the present 
invention. After processor 26 has stored a large number 
of forms of the same type in memory 36, it selects one of 
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the fields in the forms to process, at a selection step 
50. For example, the processor may select the car make 
field, as illustrated in Fig. 2, either autonomously or 
under the control of operator 30. The processor extracts 
5 the contents of the selected field from each of the form 
images, at an extraction step 52. 

Before presenting the field images to operator 30, 
processor 26 sorts and groups the images based on mutual 
similarity, at a sorting step 54. There are many methods 

10 known in the art for measuring similarities between pairs 
and groups of images, and substantially any such method 
may be used at this step. Preferably, each of the field 
images is analyzed by OCR to give a respective string of 
characters, and a string-matching algorithm is applied to 

15 sort the images. Even when the OCR results are 

unreliable or incomplete, they are generally still 
adequate for error-tolerant string matching, as is known 
in the art. In the example shown in Fig. 2, the 
handwritten entry "chewy" has been found to be a close 

20 enough match to "Chevrolet" to be placed in the same 
group. Misspellings and undecipherable characters are 
likewise preferably ignored in the string matching 
process. In longer entries, containing multiple words, 
images may be grouped together when they are found to 

25 contain one or more words in common. 

Alternatively, morphological criteria may be used in 
sorting the field images. For example, lines, arcs and 
loops may be identified in the images, and used in 
constructing a vector representing the image features. 

30 The images can then be grouped and sorted by measuring 
vector distances between them. Further alternatively, 
the images may be compared and sorted by pixelwise 
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correlation with one another or with one or more 
templates. This latter technique is useful particularly 
when the images to be coded are pictures of people or of 
objects, rather than containing characters and words. 
5 Other methods of sorting and grouping images will be 
apparent to those skilled in the art. 

Processor 26 presents the field images in window 40 
on display 28 in their sorted order, beginning with a 
first image I 0 , at an initial coding step 56. Operator 30 

10 keys in the appropriate alphanumeric characters to code 
the first image, and then confirms that the coding (which 
appears in window 42) is correct. In the present 
example, assume that the first image was coded as 
"Chevrolet." Processor 26 then displays the next image Ii 

15 in window 40, alongside the code assigned to image I 0 in 
window 42, at a display step 58. This is the situation 
pictured in Fig. 2. Operator 30 decides whether the code 
appearing in window 42 is the correct one for the current 
entry, at a decision step 59. If so, the operator can 

20 simply confirm the code with a single stroke, either by 
clicking mouse 34 or pressing an appropriate key on 
keyboard 32, at a confirmation step 60. If the code in 
window 42 is not the appropriate one for the current 
entry, the operator keys in the new code, at a key-in 

25 step 62. 

Processor 26 then goes on to the next image, I 2 , and 
repeats steps 58 through 62. If a new code was entered 
in the previous iteration at step 62, this is preferably 
the code that is displayed in window 42 during the next 
30 iteration at step 58. Steps 58 through 62 are repeated 
iteratively in this manner for the entire collection of 
field images, until all of the images have been coded. 
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Although the preferred embodiment described above 
deals with key-in of words appearing in a plurality of 
form documents, the principles of the present invention 
may also be applied in other image processing and data 
5 key-in applications in which an operator must code a 
large number of similar images. For example, the method 
described above may also be used in coding the contents 
of multiple fields of the same type that appear on a 
single document. Furthermore, as noted above, the 

10 principles of the present invention may be applied not 
only to coding of alphanumeric characters, but also to 
coding other types of symbols, as well as images of 
people and objects. For example, in an alternative 
embodiment of the present invention, the system and 

15 method described above can be applied to face 
identification . 

It will thus be appreciated that the preferred 
embodiments described above are cited by way of example, 
and that the present invention is not limited to what has 

20 been particularly shown and described hereinabove. 
Rather, the scope of the present invention includes both 
combinations and subcombinations of the various features 
described hereinabove, as well as variations and 
modifications thereof which would occur to persons 

25 skilled in the art upon reading the foregoing description 
and which are not disclosed in the prior art. 
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